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. Abstract. The nondeterministic quantum finite automaton (NQFA) is the only known case where a 

Pj^ ■ one-way quantum finite automaton (QFA) model has been shown to be strictly superior in terms of 

language recognition power to its probabilistic counterpart. We give a characterization of the class of 
languages recognized by NQFA's, demonstrating that it is equal to the class of exclusive stochastic 
languages. We also characterize the class of languages that are recognized necessarily by two-sided error 
by QFA's. It is shown that these classes remain the same when the QFA's used in their definitions 
• are replaced by several different model variants that have appeared in the literature. We prove several 

' closure properties of the related classes. The ramifications of these results about classical and quantum 

• , sublogarithmic space complexity classes are examined. 

1 Introduction 

> 

An interesting feature of both probabilistic and quantum computational models is that in some 
cases, the set of problems that can be solved gets larger when the automaton in question is allowed 
, to make more error in its decisions, whereas in some other cases, such a relaxation does not increase 

the computational power at all. When one-way probabilistic finite automata (PFA's) are required 
■ to make no error in their decisions, they recognize exactly the class of regular languages. When 

they are allowed to make bounded error, that is, to give the correct response for each input with 
probability at least ^ + 5, for a fixed 5 > 0, the class of languages that are recognized remains 
the same. The computational power of PFA's is seen to increase only when we allow two-sided 
^ . unbounded error, where the only requirement is that all members of the recognized language are 

accepted with probability greater than the acceptance probability of any nonmember. 

There are several alternative models of quantum finite automata (QFA's), and differences (e.g. 
in features regarding the form and number of measurements that can be performed on the machine, 
whether an output tape is included or not, etc.) in their definitions, which would not affect the power 
of classical automata, yield QFA variants of differing computational power. In the zero-error and 
bounded-error cases, some QFA variants [28 20 30J are strictly inferior to the corresponding PFA's 
from the point of view of language recognition power, whereas the most general models |31|11|7|17] 
are equivalent to their probabilistic counterparts in those settings. In the unbounded error case, the 
languages recognized with cutpoint by the weakest QFA model [28] form a proper subclass [6] of the 
corresponding classical class (the stochastic languages), whereas it was discovered recently |42|43| 
that more generalized QFA variants, including the popular Kondacs-Watrous model, are equivalent 
to PFA's in this case as well. 
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With regard to state complexity, sufficiently general QFA models can simulate all zero-error 
and bounded-error PFA's with small overhead, and some regular languages have bounded-error 
QFA's that are exponentially smaller than the corresponding PFA [3j . In the two-sided unbounded 
error setting, quantum and probabilistic machines can simulate each other with only a polynomial 
overhead in the number of states [43| , 

We study the computational power of QFA's in the one-sided unbounded error setting, where 
one of the two responses that the machine can output about the membership of the input string 
in the recognized language is correct with certainty, and the other response has a nonzero proba- 
bility of being correct. Since the error bound can be improved by repeating the computation, an 
examination of languages recognizable in this setting is significant for understanding the power of 
generalizations of the underlying model to include, say, a two-way tape head. Just like their classical 
counterparts, QFA's that recognize their languages with cutpoint 0, (that is, with positive one-sided 
error,) are also known as nondeterministic machines. It is well known that classical nondeterministic 
finite automata recognize precisely the regular languages. In notable previous work on nondeter- 
ministic quantum finite automata (NQFA's), Bertoni and Carpentieri have shown [6j that the class 
of languages recognized by NQFA's of the Moore-Crutchfield type does not contain any nonempty 
finite languages, but does contain the nonregular language L neg = {w G {a, b}* \ \w\ a / Mb}, 
where l^l^ denotes the number of occurrences of the symbol a in the string w. Nakanishi et al. 
|29] considered the somewhat more powerful Kondacs-Watrous model of QFA's, and proved that 
NQFA's of this type can recognize all regular languages, establishing their strict superiority over 
their classical counterparts. 

In this paper, we give a full characterization of the class of languages recognized by all NQFA 
variants that are at least as general as the Kondacs-Watrous type, demonstrating that it is equal 
to the class of exclusive stochastic languages. This lies properly between the classes of languages 
recognized with zero error and two-sided unbounded error by QFA'fl Every regular language has 
a NQFA with at most linearly more states than the corresponding classical nondeterministic finite 
automaton (NFA), and there exist infinite families of regular languages which can be recognized by 
just tuning the transition amplitudes of a NQFA with a constant number of states, whereas the sizes 
of the corresponding NFA's grow without bound. We also prove several new closure properties of the 
related classes, and examine what these results imply about the comparative power of probabilistic 
vs. quantum Turing machines with small space bounds. 

The rest of this paper is structured as follows: Section 2 contains the relevant definitions and 
previously known facts. In Section 3, we give a characterization of the class of languages recognized 
by Kondacs-Watrous NQFA's, and discuss the superiority of several NQFA variants over their 
classical counterparts in terms of language recognition and succinctness. An examination of the 
relationships among languages which can be recognized by QFA's with one-sided error and those 
that require two-sided error is presented in Section 4. Section 5 contains several proofs of closure 
properties for the classes of languages recognized with one-sided error. Section 6 is a conclusion, 
where we examine the consequences of the NQFA results for classical and quantum sublogarithmic 
space complexity classes. 



1 To our knowledge, this is the only case where these three classes have been shown to be distinct for any automaton 
model, be it quantum or classical. 



2 Preliminaries 



2.1 Automata 

In the following, E denotes the input alphabet, not containing the end-markers <£ and $, and r is 
the tape alphabet, such that r = £ U {(t, $}. 

Definition 1. A (1-way) probabilistic finite automaton (PFA) with n € Z + states is a 4-tuple 
V = (S,U,{A aer },F), where 

1. S = {si, • • • , s n } is the set of states, and si is the start state, 

2. A a is the nxn real-valued stochastic transition matrix for symbol a, that is, A a (i, j) is the value 
of the transition probability from state Si to state Sj when reading symbol a, 

3. F C S is the set of accepting states. 

The probability distribution of "P's states at any point during the processing of the input 
string can be traced using an n-element row vector. For an input string w S U*, w = Qw$, 
vo = (1, 0, • • • , 0)ixn denotes the initial state vector. The effect of reading the ith. tape symbol can 
be calculated by multiplying the vector v^_i by the matrix A Wi , yielding Vj. V| w | = V()A Wl • • • A W|w| 
denotes the final state vector. The acceptance probability of w by V is 

fviw) = Yl v |w|00> (!) 
where V| w |(i) denotes the ith entry of V| w |. 

Definition 2. A generalized probabilistic finite automaton (GPFA) with n € Z + states is a 5-tuple 
G = (S, S, {A CTeI ;},vo,f), where 

1. S = {si, • • • , s n } is the set of states, 

2. A a is the nxn real-valued transition matrix for symbol a, that is, A a (i,j) is the (possibly 
negative) "weight" of the transition from state Si to state Sj when reading symbol a, 

3. vo is the real-valued initial lxn vector, and, 
4- f is the real-valued final n x 1 vector. 

A GPFA Q is associated by a function fg:£*—> M, in the following way: For an input string 

w e U*, 

/sO) = v A Wl • • • A w f. (2) 



Definition 3. A (1-way) Kondacs-Watrous quantum finite automaton (KWQFA) 120^ with n £ Z + 
states is a 5-tuple M. = (Q, U,{\J a£ r},Qacc,Qrej), where 

1. Q = {gi, • • • q n } is the set of states, and q\ is the initial state, 

2. U ^ is the nxn complex-valued unitary transition matrix for symbol a, that is, U<j(j, i) is the 
amplitude of the transition from qi to qj when reading the symbol a, 



3- Qacc an d Qreji disjoint subsets of Q, are the sets of accepting and rejecting states, and Q non = 
Q \ (Qacc U Qrej) is the set of non-halting states. 

The amplitude distribution of the states of a quantum automaton is represented by an n- 
element column vector. | uo) , the initial state vector, equals (1,0, ••• ,0)\ xn . Note the difference 
with probabilistic automata. 

For a given input string w € E* , Ai scans the tape, containing w = $10$, from the left to the 
right. During the processing of each symbol, the machine undergoes two operations: First, its state 
vector evolves according to the unitary transformation associated with the scanned symbol, that is, 

\ui) = U^Jiti-i). 

Then, the machine is observed to see whether it has accepted, rejected, or not halted yet. At 
this point, each accepting state with amplitude a adds \a\ 2 to the overall acceptance probability 
fM( w ) °f the inputd. In the particular KWQFA's that will be described in this paper, intermediate 
observations can yield the result "reject", but the accepting states can be entered only at the end 
of the computation, after scanning the right end-marker $. Halting states "drop out" of the state 
vector \u)i, their amplitudes being replaced with zeros, and the head moves on to the next symbol. 

As we have defined them, PFA's process all of the input string before deciding on acceptance or 
rejection, whereas KWQFA's can halt before reaching the end of the input. (The QFA variant that 
precisely corresponds to Definition Q] is the Moore- Crutchfield QFA (MCQFA) [28J.) This difference 
should not distract the reader, since it is easy to show that the classes of languages recognized by 
PFA's, both with general cutpoint, and with cutpoint 0, (to be defined in the next subsection,) do 
not change when the model is modified to give it this additional capability. This is true for all 
PFA variants that may be obtained by appropriately reconfiguring Definition Q] to correspond to the 
various QFA models that are cited in this paper. The only crucial distinction between Definitions [T] 
and [3] is the one between classical and quantum. 

2.2 Languages 

Definition 4. An automaton A defined over alphabet E divides E* into three disjoint subsets with 
cutpoint A6l: 

1. L(A, < A) = {w e E* I f A (w) < A}, 

2. h(A, = A) = {w e E* I f A {w) = A}, 

3. h(A, > A) = {w G E* I f A (w) > A}. 

Additionally, we define h(A, ^ A) = h(A, < A) U h(A, > A). 

Definition 5. The pair (A, A) is equivalent under cutpoint separation to the pair (A' , A'), denoted 
as (A, A) = (A', A'), if 

h(A,< A) =L(A\< A') 
h(A,= A) = L(X= A') 
h(A,> A) = L(«4.', > A'), 

2 This is the behavior allowed by the general KWQFA definition. 

3 Note that, when comparing two QFA variants with each other, this kind of difference is very important, since it 
usually affects the computational power of the models. 



where A, A are automata and A, A' € M are outpoints. 

Definition 6. T/ie language recognized by automaton A with outpoint A € K is defined as 

h(A, A) = L(„4,> A). 

L(.4., A) is said to be recognized by automaton A with one-sided cutpoint AgR ifh(A, < A) = 0. 
Definition 7. JM/ 

1. The languages recognized by PFA's with cutpoint A € [0,1) constitute the class of stochastic 
languages (S > ). The collection of languages whose complements are stochastic is the class co- 
S>. 

2. Languages of the form L("P, = A), for any PFA V , and any A € [0, 1], constitute the class 5^. 

3. Languages of the form I^T 3 , 7^ A), for any PFA V, and any A € [0,1], constitute the class 
exclusive stochastic languages (S^). 



Remark 1. In the study of complexity classes defined in terms of Turing machines, "recognition with 
cutpoint" is used synonymously with "unbounded-error recognition". This usage does not provide 
an appropriate coverage of the intuitive concept of unbounded-error computation that we described 
in Section Q] in the case of PFA's: Given a PFA V which recognizes a language L with cutpoint, one 
can build a new PFA V for the complement of L by just switching the statuses of the accepting 
and non-accepting states of V . Since V accepts any member of L with greater probability than any 
nonmember, we say that it recognizes L with unbounded error. However, since S > is not known to 
be closed under complementation, we do not know in general whether L is stochastic or not. For 
this reason, we take S 5 * U co-S > to be the class of languages recognized with unbounded error by 
PFA's 0. 



Definition 8. 

1. The languages recognized by KWQFA's (MCQFA's) with cutpoint A € [0, 1) constitute the class 
QL (MCL). 

2. The languages recognized by KWQFA's (MCQFA's) with cutpoint 0, i.e., those of the form 
h(M,0), for any KWQFA (MCQFA) M, constitute the class NQL (NMCL). 

As mentioned before, nondeterministic computation corresponds to recognition with cutpoint 0, 
and so NMCL and NQL denote the classes of languages recognized by nondeterministic MCQFA's 
and KWQFA's, respectively. 



4 Note that S^ at , the class of languages recognized with cutpoint \ by PFA's whose transition matrices contain only 
rational numbers, is known |37] to be closed under complementation. It is however customary to define PFA's and 
QFA's with general transition probabilities/amplitudes, as we did in Definitions [1] and [3] in the finite automata 
literature, and we follow this convention. See Section [6] for more on this point. 



Fact 21. [36] Let G\ be a GPFA and Ai G R be a cutpoint. For any cutpoint A2 G M, there exists 
a GPFA g 2 such that Ai) = (G 2 , A 2 ). 



Fact 22. [32] Let Pi be a PFA and Ai G [0, 1) be a cutpoint. For any cutpoint A2 G (0, 1), there 
exists a PFA V 2 such that (Pi, Ai) = (V 2 , A 2 ). 



Fact 23. @3] For any PFA 7>, there exists a KWQFA A4 such that {V, \) = (M, \). 

Fact 24. [23|43j For any KWQFA M and cutpoint A G [0,1), there exists a GPFA £ such that 

(M,\) = (g,\). 



Fact 25. [36] For any GPFA Q and cutpoint Ai G R, there exist a PFA P and a cutpoint A 2 G (0, 1) 
such that (<7,Ai) = {V,X 2 ). 



Fact 26. [6] MCL C S>. 



Fact 27. [8] Any MCQFA with n states can be simulated by a KWQFA with 2n states, so MCL 
C QL and NMCL C NQL. 

Fact 28. [35] The class of regular languages is a proper subset of both and S = . 
Fact 29. [32] C S> and S>\ S= ± 0. 



Fact 210. [9|29j The class of regular languages is a proper subset of NQL. 

By Facts I2"3l25l QL = S > , PFA's and KWQFA's have the same language recognition power with 
general cutpoint and with unbounded error (Remark [1]). It has in fact been shown [32] that all 
one-way QFA models |31|11|7|T5] that generalize the KWQFA are also equivalent to the PFA in 
this regard. (See Subsection 13.21 for more on this.) 

We are interested in the case of one-sided unbounded error, where one of the two responses 
that the machine can output about the membership of the input string in the recognized language 
is correct with certainty, and the other response has a nonzero probability of being correct. We 
say that such an automaton has positive one-sided error if it rejects non- members of its language 
with certainty. This corresponds to recognition with cutpoint 0. The opposite case is called negative 
one-sided error, where the language in question is of the form L(.A, 7^ 1), recalling that, when A is 
a PFA or a QFA, f A has range [0, 1]. 

PFA's can recognize all and only the regular languages with cutpoint [26]. KWQFA's can do 
more than that, as will be characterized in the next section. 



3 Languages Recognized with One-sided Error 

We start the presentation of our main result by stating a fact which will be useful in several proofs 
in the paper. 

Lemma 1. For any language L, L € if and only if there exists a GPFA that recognizes L with 
one-sided cutpoint 0. 

Proof. The forward direction is proven on page 171 of [32]. In the reverse direction, if a GPFA 
recognizes L with one-sided cutpoint 0, then L € by Fact [25j □ 

3.1 A characterization of NQL 



Lemma 2. C NQL. 

Proof. If L e S^, then there exists an ra-state PFA V = (S, £, {A CTer }, F) such that L = L("P, / \) 
We define S' , v , and {A^ gr } as follows: 

1. S = S U {s n+1 ,s n+2 , s n+3 }; 

2. v = (1, 0, ■ ■ ■ , 0) is a 1 x (n + 3)-dimensional row vector; 

3. Each A a is a (n + 3) x (n + 3)-dimensional matrix: 
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computation "imitates" the processing of w by V; the first n entries of the manipulated vector v 
contain exactly the state vector of V (multiplied by ^) in the corresponding steps of its execution. 
The last matrix multiplication results in 



M 



lXTl 







The (n + l) st entry of v.. equals if and only if w ^ L. 

Using a modified version of the PFA simulation method described in |43j . we can construct a 
KWQFA M. = (Q, U, {Uo-gr}, Qacc, Qrej) recognizing L with cutpoint 0: For each a G r, d a is built 
according to the template 



c„A, 



C„B n 



\ 



J 



where B a = [bij] is a lower triangular matrix, and C a = [cj -■] is a diagonal matrix. The entries of 
Uo- are computed iteratively using the following procedure: 

1. The entries of B a and C a are set to 0. 

2. The entries of B a are updated to make the rows of ( | B a ) pairwise orthogonal. Specifically, 



for i = 1 • • • n + 2 

set bij = 1 

for j = i + 1 ■ ■ ■ n + 3 

set bj : i to some value so that the ith and jth rows become orthogonal 

set Imax to the maximum of the lengths (norms) of the rows of ^A CT | B a 

3. The diagonal entries of C a are updated to make the length of each row of ^A^. | B a \ C^j equal 
to l max . Specifically, 

for i = 1 • • • n + 3 

set U to the current length of the zth row of ^A^. | B^ 

set c; i to \ l max — If 



4. Set c CT to 



1 



5. The entries of D a are selected to make U£ a unitary matrix. The transpose accounts for the 
difference between the probabilistic and quantum vector notations. 

The state set Q = Q non U Qacc U Qrej is specified as: 

1. q n +\ € Qacc corresponds to state s n +i; 

2. € Qrej corresponds to state s n+ 2] 

3- • • • , <JW, 9n+3} € Qnon correspond to the remaining states of S* , where c/i is the start state; 
4. All the new states that are defined during the construction of {Uo-er} are rejecting ones. 

A4 simulates the computation of V for a given input string w € U* by representing the probability 
of each state Sj by the amplitude of the corresponding state qj\ specifically, this amplitude equals 

c <t frii=i ^.j x v '(j) immediately after the (k + l) st step of the computation [13], where k < \w\. 
The transitions from the 2re + 6 states added during the construction of Uo-er for ensuring unitarity 



do not interfere with this simulation, since the computation halts immediately on the "branches : 
where these states are entered. Therefore, the top n + 3 entries of the state vector of M equal 




lXTi 



2/pM ~ 1 3-2/pM c 



just before the last measurement on the right end-marker. Since the amplitude of the only accepting 
state is nonzero if and only if w € L, L is recognized by A4 with cutpoint 0. □ 



Lemma 3. NQLQ . 

Proof. By Fact [2^1 there exists a GPFA with one-sided cutpoint for any member L of NQL. By 
Lemma [TJ L is an exclusive stochastic language. □ 



Theorem 1. = NQL. 



Corollary 1. S~ is precisely the class of languages that can be recognized with negative one-sided 
error by KWQFA 's. 



The superiority of KWQFA's over PFA's in the one-sided error setting now follows from Fact 
By Fact \29\ there exist languages that KWQFA's can recognize with two-sided, but not one-sided 
error. The class of these languages is precisely (S > U co-S > ) \ (S 7 ^ U S = ) (Remark 2.1). Note that the 
above results also establish that the class of languages recognized by NQFA's is not closed under 
complementation (Fact [52]) . 



3.2 More general QFA models 

Several one-way QFA models (like |30|31|11|7|18| . and the one-way version of the machines of [3],) 
that generalize the KWQFA have appeared in the literature. In the bounded-error case, some of 
these generalized machines recognize more languages than the KWQFA. We claim that the classes 
of languages recognized by the nondeterministic versions of all these automata are identical to each 
other, and they coincide with NQL. 

We demonstrate this fact for one of the most general models, namely, the quantum finite au- 
tomaton with ancilla qubits (QFA-A) [3Tj, which can simulate all known one-way QFA models. Let 
us give the name QFA-Ao to the class of languages recognized with cutpoint by QFA-A's. For any 
QFA-A A4, there exists a GPFA that computes exactly the same acceptance probability function 
as M [35]) so QFA-Ao Q by Lemma HJ Since any KWQFA can be simulated by a QFA-A, NQL 
C QFA-Aq . Therefore, QFA-A = NQL= S*. 



3.3 Space efficiency of QFA's with cutpoint 



ft is well known |3|27j that some infinite families of languages can be recognized with one-sided 
bounded error by just tuning the transition amplitudes of a QFA with a constant number of states, 
whereas the sizes of the corresponding PFA's grow without bound. After a simple example, we will 
argue that this advantage is also valid in the unbounded error case. 

Definition 9. For m € Z + , L m C {a}* is defined as 

L m = {a 1 | i mod (m) ^ 0}. 

Theorem 2. For m > 1, L m can be recognized by a 2- state mcqfM with cutpoint 0. 

Proof. A4 begins the computation at state qo, and each transition with the symbol a corresponds 
to a rotatioro by angle ^ in the |<7o)-|<Zi) plane, where q\ is the accepting state. □ 

For any positive n, it is known [26] that every n-state PFA with cutpoint has an equivalent 
nondeterministic finite automaton with the same number of states. Therefore, only finitely many 
distinct languages can be recognized with one-sided unbounded error by PFA's with at most n 
states. 

Combining this with the fact that any n-state PFA with cutpoint can be simulated by a 
KWQFA with In + 4 states using a simple adaptation of the technique of [43], the superiority of 
QFA's over PFA's in this regard is established^- 

4 S^, S = , and Languages Recognized with Two-sided Error 

To gain a better understanding of the classes of languages recognizable by positive one-sided, neg- 
ative one-sided, and necessarily two-sided error by QFA's, we examine some examples from each of 
those families. Bertoni and Carpentieri [6] showed that L neq is in NMCL, and that its complement, 
say, = {w € {a, b}* \ \w\ a = \w\b}, is not in MCL. Now that we have TheoremCQ we can use the 
well-known results [32j?] from the PFA literature that state that L eq € S = , L neq € S^, but not vice 
versa, to conclude that stronger QFA variants also can not recognize L eq with positive one-sided 
error, and neither can they recognize L neq with negative one-sided error. Similarly, Lace et al. [21j 
proved recently that the complement of the palindrome language L pa i = {w € {a, b}* \ w = w r } is 
in NQL. We can show the corresponding result for L pa i using the following fact: 

Fact 41. p3] Let L € S = . Then there exists a natural number n > 1, such that for any strings 

u,v,y G E*, 

if uv, uyv, • • • , uy n ~ l v € L, then uy*v C L. 

Theorem 3. L pal £ . 

5 There is an equivalent 4-state KWQFA. 

6 For details of a similar construction for a nonregular language, see [6]. 

7 Note that a QFA-A can realize this simulation with just n states. 



Proof. Suppose that L pa i G S^. Then L pa i G S . Let u = a n b, y = a, and v = e. 



a n b, a n ba, ■■■ , a n ba n 1 G L pai 



imply that a n ba n G L pa i by Fact [HI Since this string is actually a member of L pa i, we have a 
contradiction. □ 

We will now exhibit some languages which can only be recognized by two-sided error by a QFA. 

Theorem 4. L = {cra?i U biu 2 \ m G L eq ,w 2 G L ne q} G 5 > \ (S^U S^). 
Proof. Suppose that L G S^, then there exists a GPFA 

£ = (S, S, {A CTe{ai6} },v ,f) 
recognizing L with one-sided cutpoint 0. The GPFA 

O' = (5 , ,i: ) {A . e { a)6 }} ) voA 0) f) 

recognizes L eq with one-sided cutpoint 0, meaning that L eq G S^. This contradicts the well-known 
fact mentioned in the first paragraph of this section. Suppose now that L G S = , then 

L = {eU au>2 U bw\ \ w\ G L eq , w 2 G L neq } 

is in S^, which also results in a contradiction for the same reason. Since both L eq and its complement 
are stochastic, it is not difficult to show that L is stochastic. □ 

Lemma 4. L it = {w G {a, b}* \ \w\ a < \w\ b } £ (S^U S^). 
Proof. Suppose that Lit G S = . Let u = e, y = a, and v = b n . 

b n ,ab n ,--- ,a n ~ 1 b n G Lu 

imply that a n b n G Lit by Fact 14 1[ Since this string is actually a member of Lit, we have a contra- 
diction. 

Similarly, suppose that Lu G S^, or Lit € S = . Let u = a n , y = b, and v = b. 

a n b,a n b 2 ,--- ,a n b n £Lj t 

imply that a n b n+1 G Lu by Fact HTJ Since this string is actually a member of Li t , we have a 
contradiction. □ 



Corollary 2. L u G S> \ (5=U S*). 

Proof. This follows from Lemma [J] and the fact that Lu G S 1 * [33|19j . 



□ 



Theorem 5. L eq . b = L eq ■ b + G S> \ (5=U 

Proof. The proof of L eq . b $ (S=U S*) uses the setup presented in Lemma 01 i.e., 

1. select u = e, y = a, and v = b n to contradict with L eq .f, G S = , 

2. select u = a n , y = b, and v = b to contradict with L eq . b G S^. 

Any string w is a member of -L e(3 .fe if and only if it has the following three properties: 

— w ends with b. 
-we L it . 

— Let u be the longest prefix of w ending with a (u = e if w G {fr*})- Then, u G L^. 

Since these properties can be checked easily by a two-way PFA with bounded error, L eq . b G S > 
[33TT9] . □ 

We conclude this section by showing the stochasticity of an important family of languages. 

Definition 10. \2J$ The word problem for a group is the problem of deciding whether or not a 
product of group elements is equal to the identity element. 



Definition 11. Let Q k = (G,o) be a finitely generated free group with a basis 

E = {(7l, . . . ,<Tk>Vl ; • • • l°~k }' 

where k G Z + is the rank of Q k . L wp (Q k ) C U* is the language defined as 

L wp {Q k ) = {w = wi ■ ■ ■ w\ w \ | Wi G U, 1 < i < \w\, w\ o ■ ■ ■ o t0| w | = i}, 
where % G G is the identity element of Q k . 



Fact 42. (Page 1 of [25]) Let g kl and g k2 be finitely generated free groups. Then Q kl and Q\ 2 are 
isomorphic if and only if k\ =ki- 



Corollary 3. L wp (Q kl ) and L wp (Q2 2 ) are isomorphic if and only if k\ = k<i, where Q^ 1 and Q^ 2 are 
finitely generated free groups. 

As a generic name, L wp (k) can be used instead of L wp (Q k ) due to Corollary [3l where k G Z + . 
Fact 43. [39] L wp (l) G S>. 



Fact 44. [9] L wp (k) G co-NMCL, the class of languages whose complements are in NMCL, for any 
k G Z+. 



Corollary 4. L wp {k) € for any k € Z 



We will now provide a proof of the following theorem. 

Theorem 6. L wp (k) € ifP for any k > 2. 

In fact, Theorem [6] was stated as a corollary on page 1463 of [9], but the purported proof there 
was based on the claim that co-NMCLCMCLC S > . It is however known [6], as we mentioned above, 
that a member of co-NMCL (L eq ) lies outside MCL. Furthermore, the same demonstration can be 
easily extended to L wp (k), where k € Z + . 

Corollary 5. L wp {k) MCL for any k € Z+. 

Since it is still an open problem whether S = C S > or not, we cannot use Corollary [4] directly to 
prove Theorem [6J Instead, we will focus on a subclass of S = that is known to be a subset of S > . 

Definition 12. \37$ S^ at is the class of the languages of the form L(£/, = A), where Q is a rational 
GPFA, (i.e. one whose transition matrices and initial and final vectors contain only rational num- 
bers,) and X is a rational number. Additionally, Sf at is the class of languages whose complements 

Fact 45. [37] S= t C S>. 

Definition 13. SOs(Q) is the group of rotations on R 3 that are 3x3 dimensional orthogonal 
matrices having only rational entries with determinant +1. 

Fact 46. |34|13j For any k > 2, SOa(Q) contains a free subgroup with rank k, namely S k . 

Proof of Theorem^ For L wp (k), we define a rational GPFA Qk = ({si, S2, S3}, {A-aes}, vo, f), 
where 

1. 27= {Ri,... ,R k ,R[ 1 ,... is a basis of S k ; 

2. A CT = u for each a G 27; 

3. v = (1,0,0); 

4. f=(l,0,0) T . 

It is obvious that w 6 L wp {k) if and only if Ai . . . = ^3x3 if and only if fg k (w) = V0A1 . . . A w f = 1, 
where w S 27*. Thus, L wp (k) G Sr7 at by selecting the outpoint as 1. We can conclude with Fact 1431 □ 



5 Closure Properties 

The previously discovered closure properties of S > , and S = are listed below. 
Fact 51. 

1. S > is not closed under union and intersection |16|15|22] . 

2. S > is closed under union and intersection with a regular language |10|35] . 

3. S > is closed under reversal [36] . 

4. S > is not closed under concatenation, Kleene closure, and homomorphism [12|38| . 

5. S > is closed under complementation over unary alphabets |16j . 



Fact 52. 

1. Both and S are closed under union and intersection [32]. 

2. Neither nor S is closed under complementation [14j . 

3. S > is closed under intersection with a member of S 7 ^ |32| . 

We will prove several new nontrivial closure properties of the "one-sided" classes and S = . 
5.1 Dissimilar closure properties of and S = 

The proofs of the next few theorems use the capability of GPFA's to implement nondeterministic 
branching by just adding the transition matrices of the branches, and the nice properties of com- 
putation with one-sided cutpoint 0. 



Theorem 7. is closed under concatenation. 

Proof. IfLi,L2 G S^, then there exist two GPFA's Gi = (S' , U, {A^ ei: }, v , f ) and G2 = (S" , S, {A^ gi; }, v ' , f") 
such that L\ and L2 are recognized with one-sided cutpoint by Q\ and G2, respectively. Let n\ 
and 77,2 be the sizes of the state sets S\ and S2, respectively. 

We construct a new GPFA Q = (S, U, {A a& s}, vo, f ) recognizing L = L1L2 (concatenation of L\ 
and L2) with one-sided cutpoint 0. The details of Q are as follows: 

1. The size of S is n = n\ + 712; 

2. vq is a 1 x n row vector, 

(a) vo = (v I Vq) if e (empty string) belongs to L±, and 

(b) v = (vq j 0ixn 2 ) if e i- W, 

3. f is a n x 1 column vector, 

(a) f = ((f') T I (f") T ) T if eeL 2 , and 

(b) f=(0 lxni |(f") T ) T ife^L 2 ; 

4. {hacs} is the set ofnxu matrices, 



a; 




0rt2 Xni 


A^ 



(3) 



where X CT is an n\ x n 2 matrix, defined as 



v (l)A CT f 



V column 1 



Vo(2)A;f 
column 2 



a , , . i _/ 
v (n 2 )A CT f 

column n 2 / 



(4) 



The idea behind the construction is that for a given input string w G U* , each prefix of w, say, 
u G U* (w = uv), is checked for belonging to L\, and if so, the rest, v, is checked for belonging to 

Q simulates Q\ in the first n\ positions of its state vector. If Q\ accepts an input prefix u ending 
with a, the result of the multiplication between that 1 x n\ row vector describing the distribution 
after processing the first \u\ — 1 input symbols and the column vector A CT f , that is, fg 1 (u), will be 
positive. Otherwise, fg^u) = 0. By Equations [3] and HI the vector fg 1 (u).v will be added to the 
last tt-2 positions of CJ's state vector, meaning that Q2 will run on the remainder v of the input. 
The contribution of this branch of the computation to fg(w) is just the product of the value fg 2 {v) 
and the coefficient fg^u), and will be positive if both substrings are accepted by the respective 
machines, and zero otherwise. Since Q2 starts running in this manner in each step, its part of the 
overall state vector contains in general the sum of many 1 x n 2 vectors, multiplied by their respective 
coefficients, at any intermediate step of the computation. The cases where the empty string appears 
in L\ or L 2 are handled appropriately. 

In other words, 



and Q recognizes the concatenation of L\ with L 2 . 



□ 



Theorem 8. S~ is not closed under concatenation. 

Proof. L eq and {b} + are in S = , but L eq .b = L eq .{b} + is not, due to Theorem [5j 



□ 



Theorem 9. 5^ is closed under Kleene closure. 

Proof. If L E S*, then there exists a GPFA G = (S, U, {A aeS }, v , f) such that L U {e} is recognized 
by Q with one-sided cutpoint 0. Let n be the size of the state set S. 

We construct a new GPFA Q = (S, E, {A CTei; .}, vo, f ) recognizing L* (Kleene closure of L) with 
one-sided cutpoint 0. Each element of {A^. gi; } is defined as 



(5) 



where X a is an n x n matrix, defined as 

/ 

X„ = 



v (l)A CT f 



v (2)A CT f 



v (n)A CT f 



(6) 



For a given input string w G 17*, \w\ = I, 



fg' ( w ) = vo[h Wl + X Wl ] [A W2 + X^ 2 ] • • • [A Wl + X^Jf , 

and so 

fg>W = £ (n/o(«i)j, 

«i«2---Ufe=ui \i=l / 

where 1 < A; < / and each iij € 17*. Therefore, if u; can be divided, i.e., w = u± ■ ■ ■ Uk, such that 
each m G L (fg(ui) > 0), then fg>(w) > 0. On the other hand, if there is no such division, then 

fg>(w)=0. □ 



Lemma 5. Let L / = \w G {a, b\* I luiL + 1 = liok}. Then, L + , = Ln. 

y eg 

Proof. It is obvious that if iu G then w G L^. If w G L^, then 3fe G Z + such that = |ui| a +/c. 

eg 

Then, there must exist k + 1 indices, io = < ii < «2 < • • • < *fe = \w\, such that each prefix of w 
of length ij has j more 6's than a's, where 1 < j < k. In other words, to can be partitioned into k 
consecutive substrings, U\,U2, ■ ■ ■ ,U)~, satisfying 

1. w = u\U2 ■ ■ ■ Uk, and, 

2. \uj\ = ij — ij-i, that is, Uj begins with the (ij-i + l)th symbol of w and ends with the i,th 
symbol of w, where 1 < j < k. 

Since each it,- is a member of L /.we can conclude that w G L + ,. □ 

e <? eg 



Theorem 10. S" - is not closed under Kleene closure. 

Proof. It can be shown easily that L eq i is in S = . However, L*^,, which is Lu U {e} by Lemma [5l is 
not in S = due to Corollary [2j □ 



Lemma 6. Let h : 17 — > 17 \ {k} be a homomorphism such that 

h(o-) = { a > a ^ K , 
^£ , a = k 

where k is a specific symbol in 17. If L C 17* is in S^, i/ien so is h(L). 

Proof. Let £/ = (S, 17, {Ao-g^}, vo, f) be the GPFA recognizing L with one-sided cutpoint 0, and let 
17 = 17 \ {k}. For any w G h(L), there exists a u € L, such that /i(tt) = w, i.e., 

U = K C °WlK Cl ■ ■ ■ K^-l-W^K ^ 

for some nonnegative integer Cj's, where < i < |io|. In fact, we can bound all Cj's by a natural 
number, say n^, due to Fact UU Suppose that none of the strings in 

{k c ° Wi k C1 ■ ■ ■ k c m-i w , ik c i-i \ 0<i<\w\,0<Ci<n L } 



are members of L (so they are 
Then, by using Fact HTl 

* * 

* * * * r — 7" 

K W\K ■ ■ ■ K W\ W \K C L. 

We conclude that w ^ /i(L), which is a contradiction. 

Therefore, for any input string w £ (Z )*, we can simulate the computation of on some u's, 
where each Cj is guessed nondeterministically from the set {0, 1, • • • , ni, — 1}. The following matrix 
can be defined to implement the nondeterministic branching of the computation: 

X K = /+ £ A*. 

By embedding X K in a convenient way in the definition of <5, we can get the GPFA 

g' = (S,E',{A' aeS , =A CT X K },v X re ,f), 
which recognizes /i(L) with one-sided cutpoint 0. Hence, fg>(w) can be calculated as 

fg'(w)= fs( u ) 

u£{k c o Wi k c i -k c \ w \- 1 w\ w \ k°\ w \ } 

for the input string w £ (Z')*, where < Cj < ul — 1, and < i < |to|. Since the computation paths 
resulting in u ^ L produce /g(u) = 0, fg'(w) > is satisfied only when there is a computation path 
resulting in u £ L. □ 

Lemma 7. Lei /i : Z — > T* 6e a homomorphism such that \h(a)\ > /or a// <r £ Z. // L C Z* is 
in S^, i/ien so is h(L). 

Proof. Let £ = (5, Z, {Ao-g^}, vo, f) be the GPFA recognizing L with one-sided cutpoint 0. We will 
show that there exists a GPFA 

g' = (5',r,{A; er }, v ;,f') 

recognizing /i(L) with one-sided cutpoint 0. 

Q runs Q on a nondeterministically chosen input u = ui«2 ■ ■ ■ u \ u \ £ Z*, while checking whether 
its own input string w; matches h(u\)h(u2) ■ ■ ■ h(ui u i) or not. For each such nondeterministic com- 
putation path, we have the following cases: 

1. at least one of the matches fail, h(u) ^ w, then all entries representing the state vector of Q in 
this branch are set to zero, 

2. all substitutions succeed, h(u) = h{u\)h(u2) ■ ■ ■ h(u\ u \) = w, then 
(a) fg(u) = for u ^ L, and 



all members of L £ S ). 

• • k c ^- x w\ w \k c ^ C L, (1 < i < \w\,0 < Ci < ni) 
■ k c M-i<io| i/« c M C L, (2 < i < \w\, < Ci < Ul) 



(b) fg(u) > for u G L. 

fg/ (w) will again be denned as the summation over all computation paths, i.e., fg> (w) = Yl{u\h{u)=w} fG( u )- 
Hence, w G h(L) only if there is at least one successful substitution, f(u) = w, and u G L. 
The technical details of Q are as follows: 

1. For each a G U, we will use a separate 1 x |/i(<r)||/S| dimensional region in the state vector to 

)( )•••( ; 



V 



Cl 



<7_> 



It is easily formulated that \S \ = \S\ (Ylaes \^( a )\)- 
2. Each A^ €T is defined with respect to the separation above. 



/a: 



1£l 



V 



T \ 



T 



(8) 



7:0"|S| / 



Each T is almost a zero matrix, except a case which will be described below. 
3. Let a G E. Suppose that h(a) = 7172 • • • 7; G T* and Z > 0. Then, the region corresponding to 
a in Q 's state vector can be partitioned to I 1 x \S\ blocks: 



For 7 G T, A' can be partitioned into blocks of dimension |5| x \S\: If Z = 1, 



a 7 , ct = m) 



(9) 



(10) 



and if I > 1, 



A 



7.(7 



ri 



?2 



\ 



Ti- 



(11) 



/ 



4. 



where for 7, = 7, Tj = I, and Tj = otherwise, (0 < % < I); for 7; = 7, 7} = A^, and T; = 
otherwise. Additionally, for 7^ = 7, the bottom-leftmost blocks of all T's that are on the same 
row with A 7)CT in (JSj) are equal to A a ; for 72 7^ 7, all those blocks contain all 0's. 



/ 



(v I 0,--- ,0)(v I 0, • • • ,0) • • • (v I 0, • • • ,0) , 



V 



(To 



(f I 0, • • - ,0) (f I 0, - - - ,0) - - - (f I 0, 



.0) 



(12) 



(13) 



V 



When a 6 U is being guessed, where /i(<r) = 7172 • • • 7^ G T* and / > 0, the simulated state vector 
of Q is written in the first slot of ©. Whenever the matches succeed for 71, • • • , 7j_i, the simulated 
state vector of Q is transferred to the next slot in ([9]); in any other case, it is set to 0. When 7; is 
successfully substituted, this simulated state vector is updated as if Q has read the symbol a, and 
the result is transferred anew in the first slots of all the regions corresponding to symbols in U; 
otherwise, it is set to 0. □ 



Theorem 11. is closed under homomorph 



ism. 



Proof. Let h : £ — > T* be a homomorphism, L C E*, and L G S^. If h is a homomorphism of the 
form in Lemma [7J then the proof is complete. 

Otherwise, suppose that there are k > 1 symbols in 17, i.e., <ti,cj2, • • • ,cr k , such that h(ai) = e, 
where 1 < i < k. So, we can define k homomorphisms of the form in Lemma [6} 

hi : U Ui, where £1 = U \ {01} 
h 2 : £1 -> £2, where Z7 2 = Zi \ {a 2 } 



h k : £ k _i E kl where U k = H k ^i \ {a k }. 

Additionally, we define h k+ i : U k — > T*, where h k+ \{a) = h(a) for a G U k . Since h is the composi- 
tion of the /ij's (1 < i < k + 1), h(L) is also in S^ (Lemma [6] and UJ) . □ 



Theorem 12. S~ is not closed under homomorphism. 

Proof. Consider the languages L\ = {wicw2 \ w\ G L eq ,W2 G b + } and L 2 = {w\cui2 \ ui\ G L eq ,W2 G 
b*}. It is not hard to show that both languages are in S = . Let h\ and /12 be two homomorphisms 
defined as 

— h\{a) = a, h\{b) = b, h\{c) = e, and 

- h 2 (a) = a, h 2 (b) = b, h 2 (c) = b. 

L e qb = hi(Li) = ^2(^2)) and so S = is not closed under (e-free) homomorphism due to Theorem 
□ 

5.2 Common closure properties of and S= 



Theorem 13. and S= are closed under inverse homomorphism. 

Proof. Let h : U — > T* be a homomorphism, L C T*, and L G S^, such that the GPFA Q 
(S,T, {A 76 r}, vo,f) recognizes L with one-sided cutpoint 0. It is easily verified that 



g' = (5,r,{A^},v ,f), 



where 

a' _ / A «i ■ ■ ■ \h(«)\ ' /l ( CJ ) = «1 ■ ■ ' "Ifc^l ^ £ 



recognizes h 1 (L) with one-sided cutpoint 0. The same setup can be extended to any language in 
S=. □ 



Theorem 14. and S~ are closed under reversal. 

Proof. We use the same idea as [3U]. If L € S^, then there exists a GPFA Q = (S, E, {A ae x:}, vo, f) 
such that L is recognized by Q with one-sided cutpoint 0. It is easily seen that Q = (S, E, {Aj 6 ^}, f T , v 
recognizes the reverse of L with one-sided cutpoint 0. The same setup can be extended to any lan- 
guage in S = . □ 



Theorem 15. and S~ are closed under word quotient. 

Proof. If L G S^, then there exists a GPFA Q = (S, S, {A aG x;}, vo, f) such that L is recognized by 
Q with one-sided cutpoint 0. For any given w € £*, 

1. GPFA Q\ = (S, E, {A ae s}, voA Wl • • • A w , w , ,f) recognizes the language {y \ wy € L] with one- 
sided cutpoint 0; 

2. GPFA Q2 = (S, U, {Ao-gx-}, vo, A Wl •••A u ,, TO ,f) recognizes the language {z | zw € L} with one- 
sided cutpoint 0. 

The same setup can be extended to any language in S = . □ 



Theorem 16. and S~ are not closed under difference. 
Proof. There exists an L € such that L ^ . 

1. S* and L are in S^, but E* \ L = L is not; 

2. U* and L are in S = , but U* \ L = L is not. 

□ 



Theorem 17. and S~ are closed under difference with a regular language. 

Proof. Regular languages are closed under complementation, and and S = are closed under in- 
tersection. □ 

For completeness, we list below the following easy facts about MCL and NMCL: 

1. NMCL is closed under both union and intersection. 

2. Neither MCL nor NMCL is closed under complementation [6]. 

3. Both MCL and NMCL are closed under inverse homomorphism [28]. 

4. Both MCL and NMCL are closed under word quotient [9]. 



6 Concluding Remarks 



In this paper, we gave a full characterization of the class of languages recognized by all QFA models 
which are at least as powerful as the Kondacs-Watrous QFA with cutpoint 0. This is the only known 
case where the language recognition power of one-way QFA's has been proven to be strictly greater 
than that of their probabilistic counterparts^. The superiority of QFA's over PFA's with regard to 
space efficiency in this setting was demonstrated. We also examined the limitations of recognition 
with one-sided error for these models. Several new closure properties of the related classes and 
S = were proven. 

The relationship between nondeterministic quantum complexity classes and counting classes has 
been studied in detail. It is known that NQP = co-C=P [56]. More relevantly for our work, Watrous 
[40] has shown that NQSPACE(s) = co-C=SPACE(s) for s = J?(log(n)). Note that the subset of 
defined using PFA's that only contain efficiently computable transition probabilities and cutpoint 
\ equals co-C=SPACE(l), so we have proverb that co-C=SPACE(l) C NQSPACE(l), and whether 
the inclusion is strict or not depends on whether a two-way head would increase the computational 
power of a NQEA0. 

The superiority of NQFA's over classical NFA's has ramifications about relationships among 
classical and quantum nondeterministic space complexity classes for all sublogarithmic bounds. 
Although the following result follows from a combination of previously known facts, we have not 
seen it stated anywhere: 



Theorem 18. NSPACE(s) C NQSPACE(s) for s = o(log(n)). 

Proof. (Sketch.) Quantum Turing machines can simulate probabilistic Turing machines easily for 
any common space bound [UJ. There exists a NQFA with efficiently computable amplitudes (i.e. a 
constant-space nondeterministic quantum Turing machine) which recognizes the language L neq = 
{w £ {a, b}* | |w;| a ^ \w\b} [6]. It is easily seen that L neq is a nonregular deterministic context-free 
language (DCFL). It is known that no nonregular DCFL is in NSPACE(s) for s = o(log(n)) [TJ. □ 

For space bounds s € i?(log(n)), all we know in this regard is the trivial fact that NSPACE(s) 
C NQSPACE(s) 

The succinct QFA models alluded to in Section 3.3 form the basis of a demonstration [44] of 
the fact that two-way QFA's can have a similar state complexity advantage over both their one-way 
versions, and two-way classical nondeterministic automata. 

One important QFA variant that was not considered in this paper is the Latvian QFA [2] , which 
is a generalization of the MCQFA not thought to be as powerful as the KWQFA. An examination 
of the corresponding classes for this model would be interesting. 

Some other open questions related to this work are listed below. 



From a "pedagogical" point of view, this seems to us to be one of the simplest setups in which a quantum compu- 
tational model can be demonstrated to outperform the corresponding probabilistic model. 

9 All our proofs stand when the transition probabilities and amplitudes are restricted to be efficiently computable 
numbers, as mentioned in [5] . We can in fact prove that the collection of languages recognized by the most general 
model of NQFA's [3T] is precisely the class Sf at (see Definition I12p when all the amplitudes of the NQFA are 
restricted to be rational numbers. 

For any two-way PFA M and cutpoint Ai £ [0, 1), there exist a one-way PFA V and a cutpoint A2 £ [0, 1) such that 
(Ai, Ai) = ("P, A2) [19] . whereas two-way QFA's are more powerful than one-way QFA's in the general unbounded 
error setting [43| . 



Open 



Problem 1. 



Is MCL closed under union? Intersection? 



Open 



Problem 2. 



Do NMCL and MCL coincide? 



Open 



Problem 3. 



Does S^fl S contain a nonregular language? 



Open Problem 4. Is countable or uncountable? 



Open Problem 5. Is S > closed under complementation? (page 158 of [32]) 



Open Problem 6. Is S= a subset of S>? (page 173 of [32]) 

Open Problem 7. Can NQFA's with a two-way tape head recognize more languages than the 
one-way model discussed here? 
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